Search CORE

75 research outputs found

A Convex Relaxation for Weakly Supervised Classifiers

Author: Bach Francis
Joulin Armand
Publication venue
Publication date: 27/06/2012
Field of study

This paper introduces a general multi-class approach to weakly supervised classification. Inferring the labels and learning the parameters of the model is usually done jointly through a block-coordinate descent algorithm such as expectation-maximization (EM), which may lead to local minima. To avoid this problem, we propose a cost function based on a convex relaxation of the soft-max loss. We then propose an algorithm specifically designed to efficiently solve the corresponding semidefinite program (SDP). Empirically, our method compares favorably to standard ones on different datasets for multiple instance learning and semi-supervised learning as well as on clustering tasks.Comment: Appears in Proceedings of the 29th International Conference on Machine Learning (ICML 2012

arXiv.org e-Print Archive

CiteSeerX

INRIA a CCSD electronic archive server

Deep Fragment Embeddings for Bidirectional Image Sentence Mapping

Author: Fei-Fei Li
Joulin Armand
Karpathy Andrej
Publication venue
Publication date: 22/06/2014
Field of study

We introduce a model for bidirectional retrieval of images and sentences through a multi-modal embedding of visual and natural language data. Unlike previous models that directly map images or sentences into a common embedding space, our model works on a finer level and embeds fragments of images (objects) and fragments of sentences (typed dependency tree relations) into a common space. In addition to a ranking objective seen in previous work, this allows us to add a new fragment alignment objective that learns to directly associate these fragments across modalities. Extensive experimental evaluation shows that reasoning on both the global level of images and sentences and the finer level of their respective fragments significantly improves performance on image-sentence retrieval tasks. Additionally, our model provides interpretable predictions since the inferred inter-modal fragment alignment is explicit

arXiv.org e-Print Archive

CiteSeerX

Bag of Tricks for Efficient Text Classification

Author: Bojanowski Piotr
Grave Edouard
Joulin Armand
Mikolov Tomas
Publication venue
Publication date: 09/08/2016
Field of study

This paper explores a simple and efficient baseline for text classification. Our experiments show that our fast text classifier fastText is often on par with deep learning classifiers in terms of accuracy, and many orders of magnitude faster for training and evaluation. We can train fastText on more than one billion words in less than ten minutes using a standard multicore~CPU, and classify half a million sentences among~312K classes in less than a minute

arXiv.org e-Print Archive

Crossref

Adaptive Attention Span in Transformers

Author: Bojanowski Piotr
Grave Edouard
Joulin Armand
Sukhbaatar Sainbayar
Publication venue
Publication date: 01/01/2019
Field of study

We propose a novel self-attention mechanism that can learn its optimal attention span. This allows us to extend significantly the maximum context size used in Transformer, while maintaining control over their memory footprint and computational time. We show the effectiveness of our approach on the task of character level language modeling, where we achieve state-of-the-art performances on text8 and enwiki8 by using a maximum context of 8k characters.Comment: Accepted to ACL 201

arXiv.org e-Print Archive

Crossref